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UPPER TAILS AND INDEPENDENCE POLYNOMIALS IN RANDOM GRAPHS 


BHASWAR B. BHATTACHARYA, SHIRSHENDU GANGULY, EYAL LUBETZKY, AND YUFEI ZHAO 

Abstract. The upper tail problem in the Erdos-Renyi random graph G ~ Gn,p asks to estimate the 
probability that the number of copies of a graph H in G exceeds its expectation by a factor 1 + 5. 
Chatterjee and Dembo (2014) showed that in the sparse regime of p —>■ 0 as n —>■ oo with p > for an 
explicit a = an > 0, this problem reduces to a natural variational problem on weighted graphs, which 
was thereafter asymptotically solved by two of the authors in the case where H is a clique. 

Here we extend the latter work to any fixed graph H and determine a function ch{S) such that, 
for p as above and any fixed 5 > 0, the upper tail probability is exp[—( ch( 5) + o(l))n^p^ log(l/p)], 
where A is the maximum degree of H. As it turns out, the leading order constant in the large deviation 
rate function, ch{S), is governed by the independence polynomial of H, defined as Ph{x) = iH{k)x^ 
where inik) is the number of independent sets of size k in H. For instance, if IT is a regular graph on m 
vertices, then C/r(5) is the minimum between and the unique positive solution of Ph{x) = 1 + 5. 


1. Introduction 

1.1. The upper tail problem in the random graph. Let Qn,p be the Erdos-Renyi random graph 
on n vertices with edge probability p, and let Xh be the number of copies of a fixed graph H in it. 
The upper tail problem for Xh asks to estimate the large deviation rate function given by 

RH{n,p, (5) := — logP {Xh > (1 + 5)E[Ajp]) for fixed <5 > 0 , 

a classical and extensively studied problem (cf. [6,11,12,15-18,24] and [1,14] and the references therein) 
which already for the seemingly basic case of triangles {H = K3) is highly nontrivial and still not fully 
understood. It followed from works of Vu [24] and Kim and Vu [18] that^ 

< RK3{n,p,6) < r?p^ log(l/p) 

(the lower bound used the so-called “polynomial concentration” machinery, whereas the upper bound 
follows, e.g., from the fact that an arbitrary set of s ~ 5^/^np vertices can form a clique in Qn,p with 
probability = pOin^p^) ^ thus contributing (g) ~ ^Sn^p^ ~ 5E[Aii:3] extra triangles); the correct 
order of the rate function was settled fairly recently by Chatterjee [6], and independently by DeMarco 
and Kahn [12], proving that RK3{n,p, 5) +: n^p^log(l/p) for p > This was later extended in [11] 

to cliques {H = for k > 3), establishing that for p > with e > 0 fixed^, 

Rk^, (n, p, 5) +: n log(l/p) • 

The methods of [6,11,12] did not allow recovering the exact asymptotics of this rate function, and 
in particular, one could ask, e.g., whether RK3{n,p, 5) c{6)n^p‘^ log{l/p) with c{6) = as the 

aforementioned clique upper bound for it may suggest (recalling its probability was p^^ for s ~ 5^/^np). 

Much progress has since been made in that front, propelled by the seminal work of Chatterjee and 
Varadhan [9] that introduced a large deviation framework for Qn^p in the dense regime (0 < p < 1 fixed) 
via the theory of graph limits (cf. [8,9,22] for more on the many questions still open in that regime). 

^We write / < to denote / = 0{g)\ f ^ g means / = 0(g); f ~ g means / = (1 + o(l))g and f g means / = o{g). 
^More precisely, DeMarco and Kahn [11] showed that Rk^. x min{n^p''“^ log(l/p), for p > 

1 
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Figure 1. The leading order constant ch{S) for the upper tail rate function for fe-cycles 
vs. their independence polynomials Ph{x). Zoomed-in regions show Ph{ch{^)) = 1 + (5. 


In the sparse regime (p —?■ 0), in the absence of graph limit tools, the understanding of large deviations 
for a fixed graph H (be it even a triangle) remained very limited until a recent breakthrough paper of 
Chatterjee and Dembo [7] that reduced it to a natural variational problem in a certain range of p (see 
Definition 1.3 and Theorem 1.4 below). In [23], this variational problem was solved asymptotically for 
triangles, thereby yielding the following conclusion: for fixed 5 > 0, if n“^/^^logn <p = o(l), then 

7?X3(n,p, d) ~ c((5)n^p^ log(l/p) where c((5) = min{ , (1.1) 

and we see that the clique construction from above gives the correct leading order constant iff d > 27/8. 
More generally, for every k >3 there is an explicit > 0 so that, for fixed d > 0, if < p = o(l), 

RK^{n,p, 6) ^ Ck{6)n^p’'~^ log{l/p) where Cfc((l) = min{. (1.2) 

For a general hxed graph H with maximum degree A > 2 (when A = 1 the problem is nothing but the 
large deviation in the binomial variable corresponding to the edge-count) the order of the rate function 
was established up to a multiplicative log(l/p) factor by Janson, Oleszkiewicz, and Rucinhski [15]; in 
the range p > their estimate (which involves a complicated quantity M'^{n,p)) simplifies into 

^ RH{n,p,5) < n^p^log(l/p) 

(with constants depending on H and on d). As a byproduct of the analysis of cliques in [23], it was 
shown [23, Corollary 4.5] that there is some explicit an > 0 so that, for fixed J > 0, if < p = o(l), 

RH{n,p, J) X n^p^ fog(l/p), 

yet those bounds were not sharp already for the 4-cycle C 4 ,. Here we extend that work and determine the 
precise asymptotics of RH{n,p, 5) for any fixed graph H in the above mentioned range < p = o(l) 

(as currently needed in the framework of [7]). Solving the variational problem for a general H requires 
signihcant new ideas atop [23], and turns out to involve the independenee polynomial Ph{x) (see Fig. 1). 
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Definition 1.1 (Independence polynomial). The independence polynomial of H is defined to he 

Ph{x) := ^ iH{k)x^ , (1.3) 

k 

where inik) is the number of k-element independent sets in H. 

Definition 1.2 (Inducing on maximum degrees). For a graph H with maximum degree A, let H* he 
the induced subgraph of H on all vertices whose degree in H is A. (Note that H* = H if H is regular.) 

Roots of independence polynomials were studied in various contexts (cf. [4,5,10] and their references); 
here, the unique positive x such that Ph* (x) = 1 + 5 will, perhaps surprisingly, give the leading order 
constant (possibly capped at some maximum value if H happens to be regular) of RH{n,p, 6). 


1.2. Variational problem. For graphs G and P[, denote by hom{P[, G) the number of homomorphisms 
from FI to G (a graph homomorphism is a map V{H) —)• V{G) that carries every edge of H to an edge 
of G). The homomorphism density of FF in G is defined as t{H,G) := hom(FF, G) |V(G)|“I^^'^^I, that 
is, the probability that a uniformly random map V{H) —)• V{G) is a homomorphism from FF to G. 

Henceforth, we will work with t(FF, G) for G ~ Qn,p instead of Xh for convenience (the two quantities 
are nearly proportional, as the only possible discrepancies—non-injective homomorphisms from FF to 
G —are a negligible fraction of all homomorphisms when G is sufficiently large and not too sparse). 

Chatterjee and Dembo [7] proved a non-linear large deviation principle, and in particular derived 
the exact asymptotics of the rate function for a general graph FF in terms of a variational problem. 


Definition 1.3 (Discrete variational problem). Let denote the set of weighted undirected graphs on 
n vertices with edge weights in [0,1], that is, if A{G) is the adjacency matrix of G then 

— \Gn • A{Gyi) — (^ 2 ^)l<z ; 0 ^ Ojij ^ 1, Oij — Oji, an — 0 for all i,j) . 

Let H be a fixed graph with maximum degree A. The variational problem for <5 > 0 and 0 < p < 1 is 

(j){H,n,p,6) := inf ^Ip{Gn) : Gn £ with t{H,Gn) > (1 + (1.4) 


^ixiy 


where 

F(FF,G„):=n-l'"(^)l 

,ik<n {x,y)GE{H) 

is the density of (labeled) copies of H in Gn, and Ip{Gn) is the entropy relative to p, that is, 


Ip{G) := ^ Ip{aij) and Ip{x) := x log - (1 - x) log 

l<i<j<n ^ 


1 — X 

1 — p 


Theorem 1.4 (Chatterjee and Dembo [7]). Let H be a fixed graph. There is some explicit an > 0 
such that for < p < 1 and any fixed <5 > 0, 

P (t{H, Qn^p) > (1 + = exp(-(l -h o(l))(/)(FF, n,p, 5)) , 

where (j){H,n,p, 5) is as defined in (1.4) and the o{l)-term goes to zero as n ^ oo. 


Thanks to this theorem, solving the variational problem (j)(H,n,p,6) asymptotically would give the 
asymptotic rate function for FF when < p = o(l) (as was done for FF = FiT^ in [23], yielding (1.2)). 
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Figure 2. Solution candidates for discrete variational problem (clique and anti-clique). 

1.3. Main Result. Let iL be a graph with maximum degree A = A(R); recall that H is regular (or 
A-regular) if all its vertices have degree A, and irregular otherwise. Starting with a weighted graph 
Gn with all edge-weights aij equal to p, we consider the following two ways of modifying the so it 
would satisfy the constraint t{H, Gn) > (1 + of the variational problem (1.4) (see Figure 2). 

(a) (Planting a clique) Set Ojj = 1 for all 1 < < s for s ~ This construction is 

effective only when H is A-regular, in which case it gives t{H, Gn) ~ (1 -|- 5)pl®(-^)l. 

(b) (Planting an anti-clique) Set aij = 1 whenever i < s or j < s for s rsj Op^n for 0 = 0{H,S) > 0 
such that Ph*{0) = 1 -|- <5, in which case Gn) ~ (1 -b 

We postpone the short calculation that in each case t{H,Gn) ~ (1 -b to §2. Our main result 

(Theorem 1.5 below) says that, for a connected graph H and <C p 1, one of these constructions 

has Ip{Gn) that is within a (1 -b o(l))-factor of the optimum achieved by the variational problem (1.4). 
For example, when H = K^, the clique construction has Ip{Gn) ~ \ sHp{l) ~ log(l/p), while 

Pk^{x) = 1 -b 3 x so 0 = 6/3 and the anti-clique construction has Ip{Gn) ~ snlp{l) ~ log(l/p) 

(thus the clique is more advantageous iff <5 > 27/8), exactly the bounds that were featured in (1.1). 
The following result extends [23, Theorems 1.1 and 4.1] from cliques to the case of a general graph H. 

Theorem 1.5. Let H he a fixed connected graph with maximum degree A > 2. For any fixed (5 > 0 
and <C p = o(l), the solution to the discrete variational problem (1.4) satisfies 

(j){P[,n,p,6) fmin |6<, if H is regular 

n->-oo n^p^ log(l/p) if p[ is irregular ^ 

where 9 = 6{H, <5) is the unique positive solution to Ph*{9) = 1 -b (for Ph*{x) as per Defs. 1.1-1. 2). 

When combined with Theorem 1.4, this yields the following conclusion for the upper tail problem. 

Corollary 1.6. Let H be a fixed eonnected graph with maximum degree A > 2. There exists an > 0 
sueh that for n~°‘^ < p <C 1 the following holds. For any fixed 5 > 0, 

min if H is regular 

6 if H is irregular ’ 


lim 

n^oo 


-\og¥{t{H,gn,p) > (l + <5)pl^(^)l) 


j.j2pA log(l/p) 


where 6 = 6{H, 6) is the unique positive solution to Ph*{9) = 1 -b <5 (for Ph*{x) as per Defs. 1.1-1. 2). 
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Observe that when H is regular, there exists a unique 5o = ^o{H) > 0 such that 

6{H,S) < iff S<So{H). (1.5) 

That is, the leading order constant of 4>{H,n,p,6) (giving the asymptotic upper tail) is governed by 
the anti-clique for J < (5o and by the clique for 6 > 6o (the above example of H = had 5o = 27/8). 
Indeed, let k = 117(77)| > 3 and recall that Ph{x) is an increasing (as it has nonnegative coefficients) 
bijection of [0,oo) onto [l,oo), whence (by definition) 9 < iff 1 -|- <5 = Ph{G) < Ph{\^‘^^^)- Now, 

the function f{5) = — l]/<5 is decreasing in [0,oo) and satisfies lim^^oo f{^) < < l 

and lim5_^o f{S) = co, with a unique 5o in between at which /(<5o) = 1, and the observation follows. 

We remark that our results extend (see Corollary 7.3) to any disconnected graph H. The interplay 
between different connected components can then cause the upper tail to be dominated, not by an 
exclusive appearance of either the clique or the anti-clique constructions (as was the case for any 
connected graph H, cf. Theorem 1.5), but rather by an interpolation of these; see §7.2 for more details. 

The assumption p ^ in Theorem 1.5 is essentially tight in the sense that the upper tail rate 

function undergoes a phase transition at that location [15]: it is of order for p > and 

below that threshold it becomes a function (denoted M'^{n,p) in [15]) depending on all subgraphs of H. 
In terms of the discrete variational problem (1.4), again this threhold marks a phase transition, as the 
anti-clique construction ceases to be viable for p <C (recall that s ~ 6p^n in that construction). 

Still, as in [23, Theorems 1.1 and 4.1], our methods show that if H is regular and <C p 

the solution to the variational problem is (l-|-o(l)) 2 (^^/l^^^^l (he., governed by the clique construction). 

Finally, several of the tools that were developed here to overcome the obstacles in extending the 
analysis of [23] to general graphs (arising already for the 4-cycle) may be of independent interest and 
find other applications, e.g., the crucial use of adaptively chosen degree-thresholds (see §4 for details). 


1.4. Examples. We now demonstrate the solution of the variational problem (1.4), as provided by 
Theorem 1.5, for various families of graphs (adding to the previously known [23] case of cliques, cf. (1-2)). 

Example 1.7 (A;-cycle: H = Ck)- It is easy to verify that Pc^.{x) satisfies the recursion^ 

PCk (x) = Pck-i {x) + xPc ^_2 {x ), Pc 2 {x) = 2x + l, Pcs {x) = 3x + l. 

For instance, Pc^ix) = 2x^ -|- 4x -|- 1 and Pc^{x) = 5x^ -|- 5x -|- 1; by Theorem 1.5, if <C p <C 1, 

(/(C' 4 ,n,p, (5) ~ min|0(C'4,(5), n^p^log(l/p) for 9{Ci, 5) = -1 + ^l + \5, 

(/(C' 5 ,n,p, (5) ~ min| 6 '(C 5 ,(I), nVlog(l/p) for 9{C5,6) = -5 + + |<5. 

For general k, with square brackets denoting extraction of coefficients, [x]Pc,^ix) = k (more generally, 
[x]Ph{x) = |I7(i7)| for any 77), while the closely related recursion for Chebyshev’s polynomials yields 


Ifc/ 2 J . s 

Pcdx) = 2 ^-^Y 1 + 

1=0 ^ 


( 1 . 6 ) 


and so [x‘^]Pc,.{x) = ^k{k — 3); e.g., for any 7 > 4, the behavior of 9{Ck,S) for small 6 (see Fig. 1) is 
0{Ck,S) = ^ (-1 + ^/l + 26{k-3)/k') + 0{S^) = 16 + ^6^ + 0{S^) . 


^By the definition of the independence polynomial, for any graph H and vertex v in it, Ph{x) = Phi (x) -(- xPh2 (x), where 
Hi is obtained from H by deleting v and H 2 is obtained from H by deleting v and all its neighbors. 
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Finally, observe that for even k we can write (1.6) as Pc^.{x) = [^(\/l + 4x + 1)]^+ + 4x — 1)]^, 

and deduce that the value of for <5 = 2^ is simply Fcf.(2) = 2^ + 1 = 1 + <5. Thus, by the 

remark following Corollary 1.6, the transition addressed in (1.5) occurs at So(C/c) = 2^ for even k; e.g., 


(l){C4,n,p,S) _ [ 2 ^ _ if <5 <16, 

\og{l/p) \ -1 + Y^l + if <5 > 16 . 

As mentioned above, H = C^ is the simplest graph for which the arguments in [23] did not give sharp 
bounds on (/>(i^, n,p, d), and its treatment is instrumental for the analysis of general graphs (see §4.2). 


Example 1.8 (binary tree). Letting denote the complete binary tree of height h (|F(r/i)| = 2^ — 1), 
observe that, by counting independent sets excluding/including the root, Pt^{x) satishes the recursion 

Pn (x) = Pn-i + xPn-2 > Pn (a;) = i, Pt^ (x) = x +1. 


The polynomial Pt*{x) restricts us to independent sets of where all degrees are A, and therefore 

Pt'h > 

as the restriction excludes precisely the root and leaves. (More generally, for the 6 -ary tree (6 > 2) one 
has PtJx) = PTf,_^{xf + xPT^^_^{xf\ and Pt-{x) = Pt^_^{x)K) 

For instance, the binary tree on 15 vertices, has 

Prp* (x) = Pt 2 (x)^ = x'^ + 6 x^ -|- llx^ -|- 6 x -|- 1 , 
and solving Pt*{0) = 1 -|- 6 we find that, as per Theorem 1.5, if <C p <C 1 then 


4>{TA,n,p, 6 ) 
n^oo rPp^ log(l/p) 


_3 , 1 
2^2 


\lf) + 4\/i -I- (5. 


( 1 . 8 ) 


For general h, we can for instance deduce from the recurrence above (and the facts [x]Piy[x] = \V{H)\ 
and [x]Ph*{x) = #{v : deg(u) = A}) that [x]Pj'*(x) = — 2 and [x^]P'r*(x) = 2^^“^ — 7 • 2^“^ -|- 7 

for any 6 - > 3, using which it is easy to write 9{Th, 6) explicitly up to an additive 0(6^)-term. 


Example 1.9 (complete bipartite: H = for k > €}. In case k > iwe have Pk*^{x) = (H-x)^ as we 
only count independent sets in the /c-regular side (of size 1); thus, by Theorem 1.5, for <C p <C 1, 


(j){Kk/,n,p,5) 

—gTTT = 1 + K - 1 ■ 

n^oo l 0 g(l/p) 


(1.9) 


If k = £, the coefficients of x^ {j > 1 ) are doubled, so Pk^, i{x) = 2(1 -t- x)^ — 1 and for n <C p <C 1 , 


4>iKk/,n,p,5) 


lim — TT-y-y = 

n->-oo n^p''l 0 g(l/p) 


in{(l + 46)^/''-l,i 


( 1 . 10 ) 


1.5. Organization. In §2 we describe the graphon formulation of the variational problem. Section 3 
contains preliminary estimates needed from [23]. Section 4 begins with a short proof for the known 
case of it's, continues by illustrating some of the new techniques with the case of C 4 , then outlines 
the approach for general graphs. Theorem 2.3 and Theorem 2.2 are proved in §5 and § 6 , respectively. 
Finally, §7 provides the details required to pass from the continuous to the discrete variational problems. 
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2. The graphon formulation of the variational problem 


Following [23], we will analyze a continuous version of the discrete variational problem (1.4) that has 
the advantage of having no dependence on n. Recall that a graphon is a symmetric measurable function 
W : [0, 1]^ — 7 - [0, 1] (where symmetric means W{x,y) = W{y,x))] in the continuous version of (1.4), W 
will replace the edge-weighted graph Gn, as the latter can be viewed as a discrete approximation of a 
graphon (see, e.g., [2,3,19,21] for more on graph limits). As we will later see in §7 (see Lemma 7.1), 
the solution to this variational problem will immediately imply a lower bound on the problem (1.4). 
In what follows and throughout the paper, write K[f{W)] := /{W{x, y)) dxdy. 

Definition 2.1 (Graphon variational problem). For <5 > 0 and 0 < p < 1, let 

4>{H,p,6) := inf |4E[/p(lF)] : graphon W with t{H,W) > (1-|-| , (2.1) 

where 

t{H,W) := / TT W{xi,xj)dxidx2- ■ -dxwtm ■ 

For example, for H = we wish to minimize K[Ip{W)] := Ip{W{x, y))dxdy over all graphons 
W whose triangle density t{K 3 , W) = Jjq ^j 3 W (x, y)VF( x, z)W (y, z) dxdydz is at least (1 -|- 6)p^. 

The solution of the graphon variational problem is given by the following two theorems. 


Theorem 2.2. Let H be a connected A-regular graph. Fix <5 > 0 and let 0 
positive solution to Pnid) = 1 -|- 5 (for Ph{x) as defined in (1.3)j. Then 


lim —7—— 7 — 
P^o \og[l/p) 


min{0,452/|V(H)|| ^ 


9{H,5) he the unique 


Theorem 2.3. Let H be a connected irregular graph with maximum degree A. Fix 5 > 0 and let 
9 = 9{F[,5) be the unique positive solution to Ph*{9) = 1 -|- 5 (for Ph*{x) as per Defs. 1.1-1. 2). Then 

lim 

p^O p^ log(l/p) 


The proofs of the lower bounds are given in §5 and §6. The upper bounds for both theorems 
follow from the graphon analogs of the clique and anti-clique constructions in §1.3 (nearly identical 
computations yield the analogous upper bounds for the discrete variational problem (1.4)), as we 
establish in the following proposition. 


Proposition 2.4. For any fixed graph H with maximum degree A and any 6 > 0, 

(j)iH,p,6) < {9 + o{l))p^log{l/p), ( 2 . 2 ) 

for 9 = 9{H,5) the unique positive solution to Ph*{9) = l-|-(5. Moreover, if H is regular and connected, 

fi{H,p, 6) < -F o(1))p^ log(l/p), (2.3) 

where in both cases the o(l)-terms vanish as p —)• 0 . 

Throughout the paper, if 77 is a graph and u G V{H), let Nh{v) = {v : {u,v) G E{F[)} be the 
neighborhood of v in H. For S C V{H), write Nh{S) = U^,g 5 A'//(r) for the neighborhood of S, let Hs 
denote the induced subgraph of H on S, and let S = V\S. 
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( 1 , 1 ) ( 0 , 1 ) (&. 1 ) 


(0,a) 
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( 1 , 6 ) 

(0,0) (a,0) (1,0) (0,0) (1,0) 

(a) The clique graphon (b) The anti-clique graphon 

Figure 3. Solution candidates for the graphon variational problem (a x and b x p^) 


Proof of Proposition 2.4. Define the clique graphon Wi and anti-clique graphon W 2 by 


IFi =p +(1-p)l[o ,„]2 for a = , (2.4) 

fF 2 = 1 - (1 - P)l[ 6 ,i ]2 for b = ep^. (2.5) 

(See Figure 3 for an illustration of these graphons.) Observe that 

E[Ip{W,)] = log(l/p), E[Ip{W 2 )] = (26 - 6^) log(l/p); 

thus, the desired statements (2.2)-(2.3) would follow once we show that 

t{H, Wi) ~ (1 + for any regular connected graph H , (2-6) 

t{H, VF 2 ) ~ (1 + for any graph H. (2.7) 


For (2.6), we sum over the subset S C V{H) of vertices that get mapped to the interval [0, a] and get 

ScV(H) 

Recalling a x the exponent of p in each summand is E{H) + |S'|A/2 — |F( 775 )|, hence the 

non-negligible terms (of order pl®( 6 i)l) are precisely those with 2\E{Hs)\ = |*S'|A. This occurs for 
5 = 0, contributing (1 + o(l))pl^(^(l), and as H is regular and connected, the only other possibility is 
5 = V{H), contributing ~ ( 5 pl®(' 6 i)l. This establishes (2.6). 

For (2.7), by summing over the subset 5 C V{H) of vertices that get mapped to the interval [0,6], 
we obtain 

t{H,W2)= 6 l®l(l- 6 )l^(^(l-l^lpl^(%(l. 

SCV{H) 

Since 6 x p^, ignoring summands that are leaves only those with |5|A + \E(E[^)\ < \E{E[)\. 

This holds (with an equality) iff 5 is an independent set where every vertex has degree A; thus, 

tiH, W 2 ) ~ Ph* = (1 + (5)pl®(-^(l . □ 
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3. Preliminaries 

In this section, we recall various relevant estimates from [23], used there to solve the continuous 
variational problem (2.1) for the case of cliques. A key inequality used both in [23] and in its prequel 
dealing with dense graphs [22] is the following generalization of Holder’s inequality ([13, Theorem 2.1]). 

Lemma 3.1. Let /^i,/i 2 , ■ ■ ■/^n he probability measures on resp., and let pL = 0^=1/^*- 

Ai ... Am be nonempty subsets of[n] = {1,... n} and for A C [n] put pLA = OjeA Lj ~ HjeA • 

Let fi G L/‘^{QAi, P'Ai) for each i G [m], and further suppose that ^ Z £ N- Then 

^ m "* / /■ \ 

j \fir ^PaA (3.1) 

•' i=i i=i ^ 

In particular, when pi = A for every i G [m] we have f WffLi \fi\<^p < rii^i ififiA^PAy^^^- 

Let H be any graph with maximum degree A, and let VP be a graphon with t{H, W) > + . 

Since Ip is convex and decreasing from 0 to p and increasing from p to 1, we may assume W > p, i.e., 

U:=W — p satisfies 0<U<l—p, t{H,p + U) > {I + . (3.2) 

For b G (0,1], define the set Bb{U) of points x with high normalized degree du{x) by 

Bb{U) := {x : du{x) > b}, where du{x) := f U{x,y)dy. (3-3) 

Jo 

Hereafter, the dependence on U will be dropped from Bb{U) and du{x), whenever the graphon 17 is 
clear from the context. Letting A denote Lebesgue measure, by the convexity of Ip{-), for any b ^ p, 

E[Ip{p + U)]= [ Ip{p + U{x,y))dxdy > [ Ip{p + du{x))dx > X{Bb)Ip{p + b). (3.4) 

i[0,l]2 Jo 

Moreover, by Proposition 2.4, we may assume that, for some implicit constant depending on 5 and H, 

^[Ipip + U)]<p^Ipil). (3.5) 

The following estimates for Ip{x) were given in [23]; the ~ notation below is w.r.t. to limits as p —)• 0. 

Lemma 3.2 ([23, Lemma 3.3]). 7/0 < x <C p, then Ip{p + x) ~ \x^jp, whereas when p <C x < 1 — p 
we have /p(p + x) ~xlog(x/p). 

Lemma 3.3 ([23, Lemma 3.4]). There exists po > 0 such that for every 0 < p < po, 

Ipip + x) > {x/b)‘^Ip{p + b) for any 0 <x<b<l—p — log(l — p). 

Corollary 3.4 ([23, Corollary 3.5]). There is some po > 0 such that for every 0 < p < po, 

Ip{p + x) > x^/p(l — l/log(l/p)) ~ x^/p(l) for any 0 < x < 1 — p. 

As a consequence of these lemmas, observe that 

x^/^ < Ip(p + x)/Ip(l) + o(p^) for any 0 < X < 1 — p; (3.6) 

indeed, this is trivial for x <C p^^^ due to the o(p^) term; if p^/^ < x < 1 — p then Ip{p + x) > x/p(l) 
by Lemma 3.2; and in between, when p^/^ ^ x < p^^^, we have Ip{p + x) > x^/^p“^/^/p(p + p^'^^) by 
Lemma 3.3 (using b = while Ip{p + p^/^) > p^/^/p(l) again by Lemma 3.2. 
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Recalling (3.4), it follows from Lemma 3.2 (combined with (3.5)) that for any p b < 1 — p, 


xm < 


^[Ip{p + U)] . p^Ipil) . P' 


6 log (6/p) 


< 

^ 6 


(3.7) 


Ip{p + b) 

Furthermore, letting B^, = [0, l]\7?fe (here and in what follows), by the convexity of Ip{x) and Lemma 3.3 

E[Ip{p + U)]> f Ip{p + d{x))dx > Ip{p + b) j {d{x)/bfdx. 

JBh JBh 


Combining these, we get 


d‘^{x)dx < 


b^E [Ip{p + U)] 


<P^b. 


(3.8) 


iBi, Ip{p + b) 

For a bound on E[U], recall first from Lemma 3.3 that Ip[p + ap^^^^^/^y^log(l/p)) ~ ^ap‘^Ip{l) for 
any p < po and fixed a > 0 and A > 2. However, Ip{p + IE[C/]) < E [Ip{p + U)] < p^Ip{l) by convexity 
and (3.5). Therefore, by the monotonicity of Ip{p + x) for x > 0 we may conclude that 

E[U]<p^V'^og{l/p). (3.9) 

Finally, using (3.5) and Corollary 3.4 we obtain that, for an implicit constant depending on 6 and H, 

E[U^]<p^. (3.10) 

4. Solving the variational problem: from to C 4 to general graphs 

4.1. Short solution of the variational problem for triangles. The case of (and larger cliques) 
was resolved in [23] via a divide-and-conquer approach: roughly put, by setting a certain degree threshold 
b = b{p) one finds that, in any graphon whose entropy is of the correct order, the Lebesgue measure 
of the set Bh of high degree points (defined in (3.3)) asymptotically determines the surplus of Ki ^2 
copies (as in the anti-clique graphon), whereas the points in Bb are left only with the possibility of 
contributing extra triangles through “cliques.” We include a (slightly shorter) version of this proof, 
towards showing why a more sophisticated cut-off b{p) (tailored to each U) is needed for a general H. 

Theorem 4.1 ([23, Theorem 2.2]). Fix 6 > 0. As p ^ d, (j){K‘i,p,5) ^ ,6/2>^ p^\og{\/p). 

Proof. Let W = p + U with W >p and t{K 3 , W) > (1 -|- 5)p^. Expanding t{K 3 , W) in terms of U, 

t{K3, W)-p^ = t{K3, U) + 3pf(Ri,2, U) + ?,p^E[U] > 5p^ . (4.1) 

Now, E[f7] <C p by (3.9), and so (4.1) reduces to 

t{K^,U) + dpt{Ki^ 2 ,U) > (6-o(l))p3. 

Letting Bb = {x d{x) > 6} as in (3.3) and recalling (3.7), for any p<c6 <Cl—pwe have 

f U‘^{x, y)dxdy < X{Bbf < p'^/b^ <C p^ . (4.2) 


iBbXBb 


Let the degree threshold be some function 6 = 6(p) such that Y^plog(l/p) <C 6 <C 1, and note that 

I U{x, y)U{y, z)U{y, z)\{x £ Bb or y € Bb or z ^ Bb}dxdydz < 3X{Bb)E[U] <C p^ , (4.3) 

7[o,i]3 
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where the last step uses (3.7) and (3.9). Letting (5i and 62 be defined by 


(5i := p 


-2 


'BtxBi 


U‘^{x,y)dxdy, 62 := p 


-2 


'BfcXSj, 


U‘^{x,y)dxdy, 


we deduce from (4.3) and Holder’s inequality (3.1) that 


t{K 3 ,U) = _ _ U{x,y)U{y,z)U{x,z)dxdydz + o{p^) < + o{l))p^ . 

Jb^xB^xB,, 

Similarly, it follows from (3.8) and (4.2) that (not only for b as above but for any p <C 6 <C 1), 


t(iLi, 2 ,C/) = 




U{x,y)U{x,z)dxdydz + o{p‘^) < ((ii + o(l))p^ . 


(4.4) 


(4.5) 


(4.6) 


o /o 

Combining (4.5)-(4.6), the expansion (4.1) reduces to 3Ji + 62 > 5 — o(l), and using Corollary 3.4, 

iE [/p(p + U)] > (1 + o(l)) (5i + ^ 2 ) log(l/p). (4.7) 

Therefore, 

liminf/^— ^ ■ t{K 3 ,p + U) > {I + S)p^^ > ^min^ {<5i + | 52 } = min |1(5| , 


p—>-0 


p%(l) 


(5i ,52>0 

3<5i+<52^^><5 


since the minimum is attained either at (5i = 0 or ^2 = 0. This together with the upper bound in 


Proposition 2.4 (recall that Pxsix) = 3x + 1) completes the proof for the case of triangles. 


□ 


4.2. Solution of the variational problem for the 4-cycle. The argument in §4.1 can be applied 
to other graphs H and rule out certain subgraphs F <Z H from having a non-negligible contribution 
to t{H, IT) in the expansion analogous to (4.1). However, as we next see, already for C 4 new ideas are 
required to tackle all subgraphs F C C 4 and deduce the correct lower bound on (p{H,p, 6 ). 

Let W = p + U he such that W > p and t(C' 4 , W) > (1 + b)p^. Expanding t{Ci, W) a la (4.1), 

t(C 4 , IP) - / = t(C 4 , U) + dpH{Ki, 2 , U) + 4 pt(P 4 , U)>{ 6 - 0(1)) / , (4.8) 

where P 4 is the path on 4 vertices and we used that K[U] <C p from (3.9). Let Bb be as in (3.3). 
Observe that, by Holder’s inequality (3.1), embeddings P 4 1 —>■ {w, x, y, z) G [0, l]'^ with x ^ Bb satisfy 


'[0,l]xBiX[0,l]2 


U{w, x)U (x, y)U{y, z)dwdxdydz < 


/Si,x[0,l]2 


d{x)U{x, y)U{y, z)dxdydz 


< 


'Bh 


1/2 / .1 

' ' I {x, y)dxdy ] < p^Vb p^ 


(f{x)dx 


(4.9) 


by (3.10) and (3.8), using 5 <C 1 for the last inequality. Hence, embeddings of P 4 with a non-negligible 
contribution to t(P 4 , U) must place both of the interior (degree-2) vertices in Bb- The contribution from 
such embeddings is therefore at most \{Bb)‘^ < p^/ 6 ^ <C provided b ^ y/p] hence, /(Pi, U) <C p^. 

We have already encountered the term t{Ki^ 2 , U) in §4.1, and it should correspond to copies of (74 
arising from anti-cliques, as per (4.6). We therefore move our attention to the different embeddings 
of the 4-cycle t{Ci,U), to which end we use U{w, x,y, z) = U{w, x)U{x,y)U{y, z)U{z,w) for brevity. 
Clearly, any embedding placing two consecutive vertices of C 4 in Bb is negligible since, by (3.7) and (3.9), 


' B6xBi,x[0,l]2 


U{w, x, y, z)dwdxdydz < A(Pb)^E[P] <C . 


(4.10) 
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Bb Bb 

f 



Bb Bb Bb Bb Bb Bb 

(a) (b) (c) 

Figure 4. Different embeddings of the 4-cycle: (a) and (b) are non-negligible. 


Recalling 81,82 from (4.4), the three other possible embeddings of (see Fig. 4 for an illustration) 
are handled via Holder’s inequality as follows. 

(a) two nonadjacent vertices in Bf, (2 configurations): 

U{w,x,y,z)dwdxdydz < ( [ _ U‘^{x,y)dxdy'\ < [ 8 l + o{l)) . (4.11) 

KjBbXBb / 


J BbXBbXBbXBb 
(b) no vertices in Bi, (1 configuration): 


I Bt 


U {w, X, y, z)dwdxdydz < ( U^{x,y)dxdy\ < {82 + o{l)) p'^. 

yJBbXBb J 

(c) a single vertex in Bh (4 configurations): 

' Uiw, X, y, z)dwdxdydz < (<5i(52 + 0 ( 1 ))^^^ . 


hbXBl 


(4.12) 


(4.13) 


Combining (4.10), (4.11)-(4.13) and the estimate (4.6) for t{Ki^ 2 ,U), the expansion (4.8) gives 

28l + 8 ^ + A 8 i 82 + d 8 i> 8 -o{l), (4.14) 


valid for any ^/p <C 6 'C 1. As in the case of K 3 , we wish to minimize (5i + 2 < 52 ) subject to this constraint. 
Unfortunately, the minimum of (Ji -|- ^ 82 ) subject to (4.14) is not attained at Ji = 0 or 82 = 0, thus 
the lower bound obtained in this way does not match the upper bound from Proposition 2.4. 

Recall the clique and anti-clique graphons (2.4)-(2.5). The main contribution of the anti-clique to 
t{Ci, U) is through embeddings of type (a), whereas for the clique it is through embeddings of type (b). 
For the correct lower bound, we must show that the contribution from embeddings of type (c) is 
negligible; however, this can no longer be achieved using any arbitrary ^/p <C 6 1. To conclude the 

proof, we select the degree threshold b adaptively based on the graphon U. 


Lemma 4.2 (Adaptive degree threshold for C 4 ). Fix e > 0. 


There exists b with ^fp <C 6 <C 1 such that 


iBbXBl 


tj{w, X, y, z)dwdxdydz < ep'^ 


(4.15) 


Proof. Fix C > 0 such that as guaranteed by (3.10), and set M = |'2C'/e]. Further let 

^ <C 61 <C 62 ^ ^ 1- As usual, set = {x : d{x) > hi}, observing that B^j^j C ... C Bf,^. 
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For every 2 < i < M, using (3.7) and (3.8) we find that 

U{w^x^y, z)dwdxdiy(lz < X{Bb.) 




d (x)dx _ 


-p" < p" 


(4.16) 


(4.17) 


ISj,. xSi,. xRi,. , xRi,. 

Thns, any one of the degree thresholds hi will satisfy the required statement (4.15) if, say, 

/ U{w,x,y, z)dwdixdi.y(lz < \£p^ ■ 

JBb. xSt, X (Si,^ \Bi,.) XHi. 

The proof is concluded by noticing that there is necessarily some 2 < i < M satisfying (4.17), as 
otherwise—since the sets {Hfe. x Bb^ x {Bb■_-^\Bb^) x Bb^ : 2 < i < M} are mutually disjoint—we would 
get t(C' 4 , U) > (Me/2)p^ > Cp'^, a contradiction to onr choice of C since t{C 4 , U) < E[[/^]^. □ 


Remark 4.3. Note the advantage of using mnltiple thresholds with 5j_i <C hi in the proof of Lemma 4.2: 
a single threshold function hi = b (as in §4.1) would have given a bound of O(p^) on the left-hand 
of (4.16) vs. the sought o(p^). This idea will be crucial in onr arguments for general graphs. 

Using b from Lemma 4.2 for an arbitrarily small e, the constraint in (4.14) becomes <51+ 2(5i+4(5i > 6 , 
and, as in (4.7), we conclude by Corollary 3.4 that 

lim ini i : t{C 4 ,p+ U) > {1 + 6 )p'^\ > min |(5i-h ^ -h 45i > . 

p^O ^ P lp\X) J <5i,52>0 [2 J 

This minimum is clearly attained at either (5i = 0 or <52 = 0 by the next observation, which concludes 
the bound for (/>(C' 4 ,p, 5) matching the one from Proposition 2.4 (recall that Pc 4 ^{x) = 1 -|- 4x -|- 2x^). 

Observation 4.4. Let f,g be convex nondecreasing functions on [0,cx)) and let a > 0. The minimum 
of x + y over the region {x, y > 0 : /(x) -|- g{y) > a} is attained at either x = 0 or y = 0. 

Proof. By convexity, ify = ^ then /(x) < -ff{0) + {l--/)f{x+y) and g{y) < {l-'y)g{0)+'yg{x+y), so 
f{x)+g{y) < j[f{0)+g{x+y)] + {l-'y)[f{x+y)+g{0)], i.e., at most max{f{0)+g{x+y), f{x+y)+g{0)}. 
Thns, the songht minimum is either min{z : /(O) -|- g{z) > a} or minj^; : f{z) + g{0) > a}. □ 


4.3. General graphs. The solution of the variational problem for a general graph H is substantially 
more delicate and, perhaps surprisingly, involves matchings and the polynomial Ph*{x). 

By (3.2) assume W >p and letW = U+p. Expanding t{H, W) in terms of U (as in (4.1) and (4.8)), 

t{H,W)-p\^^^^\ = , (4.18) 

FCH 

E{F)f=0 

where N(F,F[) is the number of copies of F in the labeled graph H up to automorphism, i.e., 
N{F,H) := hom(T, 77)/Aut(T) with Aut(F’) the size of the automorphism group of the graph F (the 
subgraph F C H consists of the edges (x, y) G E{H) involving Lf (x, y) in the expansion of t{H,p + U)). 


4.3.1. Main contributing subgraphs. As a first step towards identifying the subgraphs F that have a 
non-negligible contribution to the sum in (4.18), we have the following simple observation. 

Observation 4.5. Let A > 2, and H be as in Theorem 2.3 and Lf a graphon satisfying (3.10). Then 
for any subgraph F of H with maximum degree A — 1 we have t{F, Lf) <C . 
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Proof. If A = 2, then F is a disjoint union of isolated edges, and by (3.9) 

t{F, U) = < /FF)l/2(log(l/^))F(F)|/2 << p\Em 

On the other hand, if A >3, then using Holder’s inequality (3.1) we get 


t(F, U) < E[U 


A-l 


A-l < 


r |B(f’)l 


< p 


A|£;(F)| 

A-l 


<C p 


\E{F)\ 


□ 


Having seen that F C H with A(F) < A(F) give a negligible contribution to (4.18), we next show 
that subgraphs F with a vertex cover number t{F) > |F(F)|/A(F) are also ruled out in this context. 

Recall that a vertex cover of F is a subset S C V{F) such that V{F)\S is an independent set, and 
r(F) is the size of the minimum vertex cover of F (thus by definition t(F) > |F(F)|/A(F) for any F). 
Further recall that the matching number e{F) is the size of a maximum matching in the graph F; the 
fractional matching number, denoted n*{F), is the maximum value of w{E{F)) := YleeE{E) w{e) over 
all weight functions w : F(F) —)• [0,1] such that ^ every vertex v G F (i.e., n* is the 

real-valued relaxation of the integer program n, as it allows 0 < w{e) < 1 rather than w{e) G {0,1}). 


Lemma 4.6. If F is a connected irregular graph with maximum degree A > 2 and t{F) > |F(F)|/A, 

i/*(F) > |F(F)|/A. (4.19) 

Proof. If F is bipartite, by Konig’s Minimax Theorem u(F) = r(F), and as u*(F) is always sandwiched 
between these two, we have u*(F) = r(F) > |F(F)|/A by our hypothesis, as required. 

Otherwise, let C be an odd cycle in F. Let uq be some vertex of degree deg(uo) < (using that 
F is irregular) and let F = ■ • • ,Ur) (r > 0) be a shortest path in F from uq to C (using that 

F is connected); finally, write the vertices of (7 as uq, ... ,V 2 k (fe > 1) with vq = Ur. We now perturb 
w = 1/A into w' by adding {—lye to (Mj,Uj+i) along the path, and adding (—l)'^+^e/2 to (uj,Uj+i) 
along the cycle {v 2 k+i = '^o)- Since C is odd, YlvBe w'{e) = for all v 7 ^ uq; thus, w' is legal as 

long as e < Finally, if r is even then w'{P) = w{P) and w'{C) = w{C) + e/2, and if r is odd 

then w'{P) = w{P) +£ and w'{C) = w{C) — e/2. Either way, u*(F) > w'{E{F)) = F(F)/A-|-e/2. □ 


Corollary 4.7. Let F be a fixed connected irregular graph with maximum degree A > 2 such that 
t{F) > |F(F)|/A. There exists some k > 0 such that, as p —)• 0, 

U) < p(i+OI^(CI p\E{F)\ ^ ^4 20) 


Proof. By (4.19), the function w = 1/A is not a local maximum of the linear program defining v*{F). 

If A = 2, one can perturb w = 1/2 into w' : F(F) —)• [0,2/3], such that YlvBe'^'^ 
Yle&E(F) > (1 + f^)\F{F)\/2 for some k > 0. Using Holder’s inequality (3.1) with these weights, 

t{F,U)< n ([ U{x,yy/-’'(-)dxdyY'^"^ < ( [ U{x,yf/^dxdyY^''^'^<p^^+^^F(F)\^ 

eeE{F)^F0d? > W[ 0 ,l ]2 ) 


where the last inequality used (3.6) followed by (3.5) to replace the integral by 0{p^) = 0{p^). 

If A > 3, one can perturb w = 1/A into w' : F(F) —)■ [0,1/2], such that < Ij and 

Yle&E{F) ^n^(e) > (1 + k)\E(F)\/A for some k > 0. Using Holder’s inequality (3.1) with these weights. 


tiF,U)< f U{x,y)F^'(^^dxdy'^ ^ ^ f 

e&E{F) ”'[04]’^ “'[C 


[ 0 , 1]2 


U{x,yfdxdy) 


< „(i+«)F(e)| 

rv_/ r 


using (3.10) for the last transition. 


□ 
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Observation 4.5 and Corollary 4.7 reduce (4.18) into the following key identity. 

t{H, W) - , (4.21) 

Fe^H 

where 

^h:={FcH : A{F) = A{H) and r(F) = \E{F)\/A{F)} . (4.22) 

4.3.2. The role of the independence polynomial. In view of the above defined observe the following. 

Observation 4.8. A graph F has t{F) = \E{F)\/A{F) iff it is bipartite and t{F) is attained by an 
independent set A (one of the two color classes) such that deg^(n) = A for all v £ A. 

Consequently, in a connected graph H with maximum degree A, every F H \n contains a 
unique independent set A such that degjj{v) = A for all u G A (its size is thus \E{F)\/A, and its 
uniqueness follows from the fact that H is connected); similarly, H itself admits 2 such sets if 77 G 
(i.e., if it is bipartite and regular), and none otherwise. Recalling Definition 1.1, it follows that 

Ph*{x) = 1+ + 1{H £ . (4.23) 

In §5-§6 we will relate each term t(F, in the right-hand of (4.21) to 

where 6 is the analog of di from (4.4), thereby leading to Ph*{0) (just as Pc^^idi) appeared in §4.2). 

4.3.3. Outline of the proof of Theorems 2.2 and 2.3. A 2-matching M of a graph F is a union of 2 
edge disjoint matchings of P (thus it is a disjoint union of cycles and paths). ^ Informally, the proof 
for a general graph H is obtained as follows. For every vertex v in the minimum vertex cover of E, we 
choose a 2-matching containing v. 

• When E is irregular, the 2-matching is chosen such that the component containing u is a path 
(crucially, the existence of such a matching is guaranteed by Lemma 5.3). One can then show that 
the only embeddings of F with a non-negligible contribution are those where alternating degree-2 
vertices of the path are all in Bh (see Definition 3.3) for a particular choice of b ; see §5 for details. 

• When F is regular, the argument is broken into two cases depending on whether the graph is 
bipartite or not. If it is not bipartite, one essentially shows that the only embedding of F with a 
non-negligible contribution is when all the vertices are in Bb (i.e., they have low degree). In case F 
is bipartite with bipartition (Fi, F 2 ), the proof proceeds by showing that for an embedding to be 
non-negligible, the vertices in Fi (and similarly F 2 ) must all be in either Bb or in Bb (e.g.. Fig. 4). 
This is shown by a generalization of the argument for the 4-cycle applied on 2-matchings whose 
connected components are even cycles. To make these arguments work one needs to choose a 
degree threshold b adaptively (as in §4.2, and unlike the irregular case), via a nested subsequence 
of degree thresholds for the different 2-matchings involving the vertices of F] see §6 for details. 

As in the case of the 4-cycle, these arguments imply a lower bound on 4>{F[,p, 6) through an application 
of Corollary 3.4, which matches the upper bound from Proposition 2.4 and concludes the proof. 

5. Irregular connected graphs: proof of Theorem 2.3 

This section is devoted to the lower bound in Theorem 2.3 (the npper bound is by Proposition 2.4). 
The following combinatorial lemmas on 2-matchings will play a central role in our analysis further on. 

^This differs slightly from the notion of 2-matchings in the literature (cf. [20]), the definition here being a special case. 
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5.1. 2-matchings. The edges of a bipartite graph G of maximum degree A can be partitioned into A 
pairwise disjoint matchings by Konig’s Edge Coloring Theorem (cf. [20, §1.4] for its proof via embedding 
G in a regular graph); each one saturates every vertex of degree A, thus Berge’s criterion for maximum 
matchings via augmenting paths implies the following (see [20, §1] for more on these classical results). 

Fact 5.1. In any bipartite graph with maximum degree A, there exists a maximum matching which 
saturates all the vertices of degree A. 


In what follows, let Ki U K 2 = {V{Ki) U V(K 2 ), F(Ki) U E(K 2 )) denote the graph union of Ki, K 2 . 
In addition, recalling the definition of in (4.22), upon referring to a bipartition of F G by 
{Fa,Fb) we will let Fa be an independent set with degj 7 ’(u) = A for all v G Fa- 

Observation 5.2. If F € is a bipartite graph with bipartition {Fa,Fb), then F has a 2-matching 
M of size 2|F(F)|/A. Moreover if v G Fb has degp(u) < A then M can be chosen with deg^(u) = 1. 

Proof. By Konig’s Edge Coloring Theorem, F{F) can be partitioned into matchings {Mi,..., Ma}; as 
Fa is an independent set with degi 7 ’(u) = A for all v G Fa, we must have Mi = |F(F)|/A for all i, so 
Mi U Mj (for any i 7 ^ j) gives a 2-matching of size 2|F(F)|/A. Einally, if u G Fb has degp{v) < A 
then there must exist i,j such that v G Mi and v ^ Mj, as needed. □ 

The next key lemma removes the assumption degi 7 ’(u) < A from Observation 5.2 for an irregular F. 

Lemma 5.3. Let F G be an irregular graph with bipartition {Fa,Fb) and A > 3. Then for every 
V G Fb, the graph F has a 2-matching M of size 2|F(F)|/A such that degn^(u) = 1. 


Proof. We may assume F is connected (by restricting attention to the connected component of v —which 
is irregular as F G and H is connected so it cannot have a nontrivial A-regular component—and 
applying Observation 5.2 to other components) as well as that degi 7 ’(u) = A (again by Observation 5.2). 

Consider the graph F' obtained from F by deleting v (and the edges incident to it). Every S C Fa has 
at least \S\ neighbors in F (as the degrees in Fa are all A), so | Ai7’/(S')| > jF] unless | Ai?(5)| = jF], which 
in turn is impossible if F is connected and irregular. By Hall’s Theorem, n{F') = jF^ij = |F(F)|/A, 
and by Fact 5.1, there is a maximum matching Mi in F' saturating all every degree-A vertex in F'. 

Let F" be the graph obtained from F by deleting F(Mi) and the edge {v, w), where w is an arbitrarily 
chosen neighbor of u in F (recall that degj\^^(u) = 0). Clearly, the maximum degree in F" is A — 1 
(which is the degree in F” of v and every vertex in Fa, except for w that has degree A — 2), and so 


r(F") > 


|F(F") _ (1-^)|F(F)|-1 


A - 1 


A - 1 


= \Fa\ - 


1 


A - 1 


As 1/(A — 1) < 1/2 (by the hypothesis A > 3), t{F") > \Fa\, but Fa is a vertex cover (already in F), 
so Konig’s Minimax Theorem together with Fact 5.1 imply that there is a maximum matching M 2 in 
F" of size |F^| = |F(F)|/A that saturates all the degree A — 1 vertices, in particular including v. 

The desired 2-matching M is now obtained as the union of Mi U M 2 . □ 


Corollary 5.4. Let F G be an irregular graph with bipartition {Fa,Fb). For any u G Fa there is 
a 2-matching M of size 2|F(F)|/A in F so that the connected component of u in M is a path. 

Proof. If A = 2 then there are no cycles in F, so the result follows from Observation 5.2; thus, assume 
A >3, let V G Nb{u), and let M be the 2-matching from Lemma 5.3 where degjv^(u) = 1, i.e., v is the 
endpoint of a path F. If u lies on a path in M we are done; otherwise, it is on some cycle C and (n, v) 
is not part of M, so we may use it to merge P and C, unravelling C in the obvious way (see Fig. 5). □ 
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Figure 5. Merging a cycle and a path to get a longer path. 


The 2-niatching obtained in the above lemma will be used throughout the paper to determine 
contributing embeddings of t{F, U). To this end we have the following definition: 

Definition 5.5. Let Mp{u), for an irregular graph F = {Fa,Fb) G and u G Fa, he a 2-matching 
of size 2\E{F)\/A such that the connected component of u in Mp{u) is a path (as per Corollary 5.f). 

5.2. Proof of Theorem 2.3. Using the 2-matching Mp{u) from Definition 5.5 we can determine the 
embeddings of F which contribute to t{F,U), for F G Hereafter, let x = {xi,X 2 ,--- ,X|y(//)|) 

denote the vector of variables indexed by the vertices of F. For brevity, define the notation 

W{x\F):= W{xi,Xj) and dx{F) := dxi. 

{i,j)eE(F) i&V{F) 

Recall the definition of Bb from (3.3). The following proposition shows that embeddings of F G 
which contribute to t{F, U) must have all the vertices of Fa in Bb, for some appropriately chosen b. 

Proposition 5.6. Suppose F G with bipartition {Fa, Fp)- For any e > 0, 70 < 1/3, and b = p^°, 

f U{x\F)l{3v €Fa:Xv€ Bb}dx{F) < (5.1) 

holds for any graphon U such that E[C/^] < p^ (as per (3.10)/. 

The proof of the above proposition is postponed to §5.3. We first show how to complete the proof 
of Theorem 2.3 using this proposition. 

Proof of Theorem 2.3 assuming Proposition 5.6. The next lemma shows that the contribution 
to t{F,U) from embeddings where Xi,Xj G Bb for {i,j) G E{F) is negligible. 

Lemma 5.7. Let E G JFp and Lf he such that E[C/^] < p^ (as per (3.10)/. For any p^/^ <C & 'C 1, 

I U{x\F)l{3{i,j) G E{F) such thatXi,Xj G Bb}dx{F) . (5-2) 

Proof. It suffices to show (5.2) for a fixed edge {i,j) G E{F). As argued in the proof of Observation 5.2, 
F is a disjoint union of A matchings of size |F(F)|/A, thus there exists a 2-matching M of F of size 
2|F(F)|/A such that (i,j) G E{M). Let be the graph obtained by removing all the edges 





18 


BHATTACHARYA, GANGULY, LUBETZKY, AND YUFEI ZHAO 


incident to the vertices i,j. Clearly, the number of edges removed is at least 2 and at most 3, since we 
cannot have degj^{i) = degj^{j) = 1 (recall that degjj{i) = A or degj;^(j) = A). Hence, 


r[0,i]li'(A'UI 


U{x\M)l{xi,Xj G Bf,}dx{M) < 




U{x\M\^ij))dx{M\^ij)) 


< 

r\_/ 



A(|fi(M)|-3) 
P 2 




The inequality between the lines is by (3.7),(3.10) and Lemma 3.1, while the last equality uses the 
assumption b ^ since A > 2. Finally, since U{x\F) < U{x\M), the result follows. □ 


Let 7 o = 1/4 and 6 o = P'^° = By Proposition 5.6, for all F G 


e |E(F)| 


/ U{x\F)l{3x,eB,,}dxiF)< 

where 7 ( 77 ) = (absorbing the implicit constant from (5.1) in e). This implies that 


fg^h 




U{x\F)l{3x^ G Bb^}dx{F) < 


(5.3) 


Now, let 

^boP"^ ■= [ _ U^ix,y)dxdy, ^boP"^ ■= [_ _ U‘^{x,y)dxdy. (5.4) 

Set 9 = 9bf, and 9' = Let 6 f be such that t{F,U) = for F G Combining (5.2) 

and (5.3) it follows that for F G with bipartition {Fa, Fb), the contribution to t{F, U) comes from 
embeddings where every vertex in Fa is in B^^ and every vertex in Fb is in B^q. That is. 


p' 


\E{F)\ 


-o{p\^^^^\) < [ U{x\F)l{x, G Bb„yv G Fa,x, ^ Bb^^v G FB}dx{F) 

V 7(77)/ 7[0,i1T('P’)I 


qo,i]T(^)i 
<p\FiF)\ 0 '-P^ 


(5.5) 


where the last step uses Holder’s inequality (3.1). Therefore, from (4.21) and the definition of 7(77), 

(j_e)p|i^WI_o(pl^WI)=pl^WI ^ n{F,H)(sf-^)-o{p^^^^^^) 

fg^h ^ 71 )/ 

< p\E{H)\ _ 1) ^ (5,6) 

Eg^h 

where the inequality between the lines used (5.5), and the last equality is by (4.23). It thus follows 
that Ph*{9) > 1 + (5 — e — o(l) > 1 + 5 — 2e, for p small enough. Moreover, by Corollary 3.4, 

iE [Ip{p + U)] >{9 + 9'12 - o(l)) p^ \og{l/p ). 

By the above discussion, since e was arbitrary, 

l^c “ ( “■F ”+^) a (1 +^ .jsiS),«+'’'/2, 

^ ^ e'>o 
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1 2 3 




Figure 6 . (a) Thick edges (colored blue and green) form a 2-matching of size 
2\E{F)\/^ = 8 for the graph F. The path Mi is the connected component of v. 
(b) The alternating degree-2 vertices of the path Mi are labeled 1,2,..., starting with 
the vertex 1 which is the neighbor of one of the endpoints (labeled 0 ). 


where S{H) := {0 > 0 : Ph*{0) > 1 + S}. Since the constraint in the definition of S{F[) does not 
involve 0', the minimum in the last expression is attained when 0' = 0. Thus, since e is arbitrary, 

lim inf I ; t{H,p + [/)>(! + 1 > 0 

for 0 = 0(M, S) the unique positive solution to Pjj* (0) = 1 -|-5, completing the proof of Theorem 2.3. □ 


5.3. Proof of Proposition 5.6. Fix v G Fa- Consider the 2-matching Mf{v) from Definition 5.5. 
Let Ml, M 2 ,..., Mg be the connected components of Mf{v), such that Mi (the connected component 
of Mf{v) containing v) is a path (see Figure 6(a)). By Holder’s inequality (3.1) and by (3.5), 

t{Mj, U) < (E[[/2])^^ < pf l®(^i)l , (5.7) 

for all 1 < j < q. 

To show (5.1) it suffices to prove that 

[ U{x\F)1{x, G Bb}dx{F) < , (5.8) 

0 [O,l]I^WI 

for every vertex v G Fa, under the same assumptions of Proposition 5.6 (note that the choice of b does 
not depend on the vertex v). Since U{x\F) < U{x\Mf(v)), the inequality (5.8) holds whenever 


f[0,i]T(*fF{*'))l 


U{x\Mf{v))1{xv G Bb}dx{MF{v)) < . 


(5.9) 


Now, by (5.7), t{Mj, U) <p 2 for every 2 < j < q; thus, (5.9) would follow from showing that 


/ U{x\Mi)l{xy G Bb}dx{Mi) < (5.10) 

y[o,i]iv^(Mi)i 

holds for b = b{p) such that p'^° < 6 <C 1 for 70 < 1/3. 

It therefore remains to prove (5.10). To this end, let /3 = |£'(Mi)| and Pa = |I^(-^i) nF^I. Since 
Ml is a path, \V{Mi)\ = P-\-l. Label the vertices of Fa(^V{Mi) with {1,2,.. -Pa}, such that vertex 1 
is the neighbor of one of the endpoints of Mi (see Figure 6 (b)). 
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Definition 5.8. For k G [/^a] and sets Ai, A 2 , ■ ■ ■ Ak C [0,1], let 

tk{Ai,A 2 ,...,Ak)= / U{x\Mi)l{xi € Ai,X 2 € A 2 ,-■ ■ ,Xk ^ Ak}dx{Mi), 
i[o,i]/5+i 

be the contribution to t{Mi, U) from the embeddings of Mi where the vertex labeled i is in Ai for i £ [k]. 


The main step in the proof of Proposition 5.6 is the following lemma. The e dependence in the term 
o(p 2 ^^) in (5.11) is suppressed and will be clear from the proof. 

Lemma 5.9. Let U be any graphon satisfying E[C/^] < (as per (3.10)j and t{Mi,U) = 

For any e > 0, 70 < 1/3, and bo = p'^°, 

tgA(.Bbo,Bbo,...,Bbo) > {6p - s)p^^ - o{p^^). (5.11) 


This will immediately imply Proposition 5.6, since choosing b = bo = in Lemma 5.9 yields 
f Uix\Mi)l{xr G Bb}dx{Mi) < f U{x\Mi)l{3s G [/3a] : G Bb}dx{Mi) 

= hgp^^ - tg^{Bb, Bb,..., Bb) < ep^^ . 

Proof of Lemma 5.9. Recalling (5.7), fix C throughout this proof such that t{Mi, U) < 

We will show that there exist 61 /§> 62 3> • • • such that, for all r G [/3a], 

tr{Bb^,Bb^,.. .,Bb^) > (Sg - erj/3A)p^^ - o(p^^). (5.12) 

By choosing r = Pa above, tf^^iBb^^, Bb ^^^,... ,Bbf^^) > {Sg - e)p^^ - o{p^^), and by monotonicity 
tiSAiBbo,Bbo,- ■ ■,Bbo) > tf)^{Bbp^,Bbp^,.. -^Bbp^) (since bp^ > bo =p^°), which implies (5.11). 

It remains to show (5.12); we proceed by induction on r. Let Mf be the path with /3 — 1 edges 
obtained by deleting the neighbor of vertex 1 in M (along with its edge). For r = 1 and any 61 <C 1, 


ti{Bb^) = 


[0,l]/3 



d(a:i)l{xi G Bb^}U{x\M^ )dx{M^ 


\ 1/2 / \ (/ 3 -i )/2 

d^(xi)dxi I ( / U‘^{x,y)dxdy\ < <C , 

1 / VlOd]^ J 


(5.13) 


where in the first equality we integrated out the variable xq (corresponding to the neighbor of vertex 
1 in Ml which has degree 1) from f7(xo,xi), and the second used Holder’s inequality (3.1). Therefore, 
ti(i3bj) = {dg — o(l))p^^, whenever bi <C 1 . 

Now, assuming that (5.11) holds for all r G [fe], we will extend it to r = A: + 1. Let 6 ', b” G [0,1], and 
consider the graph obtained by removing the two edges of Mi incident to the vertex k (Figure 6 (b)), 
which is a disjoint union of two paths. Let Li be the path which contains the vertex A: + 1, and L 2 the 
other path. Then integrating over the neighbor of A; + 1 in Li, and using Holder’s inequality (3.1) 

\ 1/2 

\ \E{L)\-1 

d^{xk+i)dxk+i (E[C/2]) 2 . 


qo,i]lv(ii)l 


U{x\Li)l{xk+i G Bp}dx{Li) = 





(5.14) 
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r, |-B(-£'2)I 

Furthermore, by Holder’s inequality (3.1), t{L 2 ,U) < 2 

gives (since |H(Li)| + \E{L 2 )\ =13-2) 


, which together with (5.14) above 


4+i([0,1], ■ ■., [0, l],Bb",Bb/) < X{Bb") 



d? {xk+i)dxk+i 




1/2 

(E[C/ 2 ])^< 


12 ^ - 

b" 


(5.15) 


where the last inequality uses (3.10) and (3.8). The above quantity is o{p^^) whenever b' <C b""^. 
Define M := /3a • Given bk, consider the following quantities for a G {0,1,..., M} 


Da := tk+i ([ 0 , 1 ],..., [ 0 , 1 ], , B,sa ) - tk+i ([ 0 , 1 ],..., [ 0 , 1 ], B .^^, B^,a+i ). ( 5 . 16 ) 

Since Da integrates the variable Xk+i over the set Bpa+i\Bf^ 3 °-, which are disjoint for 0 < a < M, we 

get > t{Mi, U) > Yl^=o Therefore, there exists oq G {0,1,..., M} such that Dag < -^p^^■ 

Let bk+i = b^k° ■ Thus, 


tk+i ) ■ • •) , Bb/.^^ ) < tfc+i ([0) 1]) • • ■) [0) 1]) ) 

< + 0(pt/5y^) = + o{p ^^), (5.17) 

Pa Pa 

where the last inequality used (5.15) after adding and subtracting tfc+i([0,1],..., [0,1], Bb^.^^, 
Moreover, tk{Bb^^_^_^, Bbj^^^,.. ■,Bb^^^) > tk{Bb^,Bb ^,.. ■,Bb^), since bk+i < bk- By this and (5.17), 

tk+l i^bk+i j j ) • • • ) -^bk+i ) — tk j , Bbi^^^ ) • ■ • ) Bbk^i) ~ tk+1 {^^bk+i ^ ■ i ^b^^i i Bbf,^^ ) 

> tk{Bb^ ,Bb^,...,BbJ- - oip^^) , 

which, by the induction hypothesis, is at least 

{6p - - -pP^^ - o{p^^) > (5/3 - {k + l)e/(3A)p^^ - o{p^^). 

Pa 

Notice that in (5.13) and subsequently all the o{p^^) terms are 0{p^^\/bi). Also, observe that in the 
above proof 6 / 3 ^ is obtained by raising bi to a large power depending on e (see M, defined right before 
(5.16)). Thus, to ensure that bp^ > p^° one can choose 61 to be p raised to a sufficiently small power 
as a function of e, which proves (5.12). □ 


6. Regular graphs: proof of Theorem 2.2 

Here we prove the lower bound in Theorem 2.2 (the upper bound is by Proposition 2.4). Recall the 
key identity (4.21); the terms for will be treated as in §5, and our main focus will be t{H, U). 

6.1. Embeddings of adaptive choice of the degree threshold. Recall that H is a 

connected A-regular graph. To determine the embeddings of H which contribute to t{H, U) we consider 
two cases: (a) t{H) > |E(H)|/A, and (b) t{H) = |£'(i7)|/A. 

We begin with the case where H has minimum vertex cover of size greater than |i3(R)|/A. By 
Observation 4.8, H is not bipartite. In this case we show that the only non-negligible contribution to 
t{H, U) comes from the embedding where all the vertices are in Bb, for some appropriately chosen b. 
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Lemma 6.1. Let H be a /^.-regular connected graph such that t{H) > \E[H)\/Then there exists 
n := k{H), such that for all b p™n{K,i/3} jj satisfying E[[/^] < (as per (3.10)j, 

f U{x\H)l{3j £ ViH),Xj £ Bb}dx{H) < . 


Proof. Fix V £ V{H). Clearly it suffices to show that 


S'!) . - 




U{x\H)l{x^ £ Bb}dx{H) . 


Let Ey be the set of edges in E{H) incident to the vertex v, and = {V{H)\{v}, E{H)\Ey). 

(i) If H = Xa+i then = K^. Let Ca be the cycle of length A. Then 

n^ A{A+i) 

s, < \{Bb)t{K^, ^ ’ 

whenever b ^ p^^^ ^ (If A = 2, the above argument fails as the above bound gives 

t{C 2 , U) < p, which is not enough. For this special case we recall the argument in [23, (3.6)], 

Sv < A(5b)E[I7] < 6“V'^^log(l/p) <C p^ , 

where the second equality follows from (3.9) and the last equality by assumption b ^ p^/'^.) 

(ii) Otherwise, since H is not a A-clique, is an irregular graph of maximum degree A. Moreover 
T{Hy) > \E{H)\/A — 1, since otherwise adding v to the vertex cover would mean that H has 
a vertex cover of size \E{E[)\/A, which contradicts hypothesis. Therefore, removing the edges 
incident to v and using the same argument as in Corollary 4.7, we get, for some n := n{H) 

s.,<X{Bf,)[ n C(a:,,a:,)da;(i/) < 


since X{Bti) = p^/b. Now, since b ^ p^, the result follows. 


□ 


Now assume that H is A-regular and has a vertex cover of size \E{H)\/A which implies that 77 is a 
bipartite graph with an equipartition {Ha,Hb). 

Proposition 6.2. Let H be a A-regular graph with r = |Fi(i7)|/A. Given e > 0 and U satisfying 
E[[/^] < (as per (3.10)j there exists some bn = ^ pi/3 such that 

[ U{x\H)l{Db}dx{H) < , 

where Db = {3u,w £ Ha '■ Xu £ Bb, Xy, £ Bb}. 

The proof of the above proposition is postponed to §6.2. We first show how to complete the proof 
of Theorem 2.2 using this proposition. 

Proof of Theorem 2.2 assuming Proposition 6.2. The next lemma shows that the contribution 
to t{H, U) from embeddings where Xj, Xj £ Bb for (f, j) £ E{H) is negligible; we omit the proof as it is 
identical to that of Lemma 5.7. 
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Lemma 6.3. Suppose H is /S.-regular with t{H) = |i?(//)|/A. For any p^^^ <C 6 <C 1 

f U{x\H)l{3{i, j) G E{H) such that Xi,Xj G Bh}dx{H) , 

where U is any graphon satisfying (3.10). 

Let bn be as in Proposition 6.2, and define 9 = and 6 ' = 9'^^ as in (5.4). Lemma 6.1 and 
Proposition 6.2 (together with Holder’s inequality (3.1)) will imply the following result. 

Corollary 6.4. Fix e > 0. Let H be a A-regular graph and U be any graphon satisfying (3.10). Then 
t{H, U) — — o(pl®^^^l) < 20' A bipartitejp^^^^^^ + 9'^ ^ , (6.1) 

Proof. Consider the two cases depending on whether or not H is bipartite: 

(i) H is non-bipartite: by Observation 4.8, t{F[) > \E{F[)\/A and by Lemma 6.1, the only non- 
negligible embedding places all vertices of H in Bbjj. Hence, by Holder’s inequality (3.1), 


t{H,U)< ( _ U‘^{x,y)dxdy] 

\J J 


\ \Em/^ 


+ O 


(pl^WI) = 0'^pl^WI+o(pl^WI), 


(ii) H is bipartite with equipartition {Ha,Hb)'. by Observation 4.8, t{H) = \E{H)\/A. Now, by 
Proposition 6.2 up to an error of all the contribution to t{H, U) comes from embeddings 

where either all the vertices of Ha are in Bb^ or Bb^j and similarly for all vertices m Hb- To 
this end, consider the following integrals 


Ti = 

r2 = 

r3 = 


'[0,1]I'"WI 


U{x\H)l{x^ GBb^, Vug HA}dx{H ), 




U{x\H)l{x^ G Bb^, Vug HB}dx{H ), 
U{x\H)l{x^ G Bbij, Vug V{H)}dx{H) 


By the above discussion and Holder’s inequality (3.1), 

t{H, U) - < Pi + r 2 + r3 < 

We have thus established (6.1). 


□ 


Now, let 5f be such that t{F, U) = for E G U {H}. Using (4.21) we can write 

§p\Em < _ p\Em ^ p\EiH)\ SpN{E, H) + o(pl®(-^)l) . 

Since H is connected, any F G .^h\{H} must be irregular. Recalling bn = p"f^^E,U) ^ pi/3 fpom 
Proposition 6.2, and notice that the proof of that proposition in fact allowed for bn pmin{o,i/ 3 }^ £qj. 
any fixed a > 0. In particular, taking a = k := k{H) is as in Lemma 6.1, we get bu pmui{K,i/3}_ 
by choosing 70 = 7 ( 6 , H, U) in Lemma 5.9, and using (5.2) and (5.3) it follows that for F G #//\{iV}, 
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with bipartition {Fa,Fb), the contribution to t{F,U) comes from embeddings where every vertex in 
Fa is in and every vertex in Fb is in Arguments identical to those in (5.5) and (5.6) give 

p\Em ^ N{F,H)e'-^, (6.2) 

where 6 := is as defined in (5.4). As for F = H, replacing e by e/ k{H) in Corollary 6.4 gives 

p\E{H)\ _ o(p|i?WI) = t{H, U) - - o(pl^(^)l) 

k{H) ) i^\H) 

< 26^ A ^{H bipartitejpl^^^^l + 9'^ a '' p\E{H)\ ^ (6.3) 

where 6 ' = 0'^ is defined in (5.4). 

Combining (6.2) and (6.3), and revisiting (4.23), it follows that, for p small enough, 

Ph*{9) > l + 5-e-o{l) >6-2 e. 

Now, recall from Corollary 3.4 that Ip{p + C) > (1 + o(l))(0 + log(l/p). Hence, for any e > 0, 

lim inf / ^ ^ a w~*~^ —^ ^ (1 + \ > min 9 + 9'/2 , 

P^o I p^/p(l) J e,6'&A{H) 

where A{T-L) := {9, 0' > 0 : Ph*{9) > 1 + 5 —2e}. By Observation 4.4, the minimum is attained either at 
9 = 0 (as per the clique graphon from (2.4)) or 0' = 0 (as per the anti-clique graphon from (2.5)). □ 



6.2. Proof of Proposition 6.2. Let u,w € Ha be such that dist/f (tt, rc), the distance between u and 
w, is 2. We will show 

[ U{x\H)l{xu € Bb, £ Bb}dx{H) < , (6.4) 

for all u,w £ Ha with dist h{u, w) = 2. Clearly this suffices since the graph is connected. 

Observe that Lf is a disjoint union of A matchings each of size \Ha\ = \E[H)\//A. By hypothesis there 
exists a path u, v, w in H. Let Mi and M 2 be two disjoint matchings of H of size \E{H)\/A, which has 
the edges {u, v) and (u, w), respectively. Consider the 2 -matching Mu^w = M 1 UM 2 . Since all the vertices 
have degree 2 in M, it is a disjoint union of cycles -where Ylj=iPj — ‘^\PiP)\/ 

W.l.o.g. assume that u,w lie on the cycle By Holder’s inequality, 

(6.5) 

As in (5.10), by considering the connected components of H, it is easy to show that Proposition 6.2 
holds if for any e > 0 , there exists bn ■= (not depending on u, w) such that 

\Eic,^)\^ ( 6 . 6 ) 

for any graphon U satisfying (3.10). 

The rest of the section is devoted to the proof of ( 6 . 6 ). Label the vertices of with {1, 2,..., /3i} 
in cyclic order. For 1 < a < /3 and sets Ai, A 2 C [0,1], define 




U{x\Cp^)l{xu £ Bb, Xy, £ Bb}dx{Cp^) 


< 


A 

ep 2 


Sa{Ai,A2) 



U{x\Cp)l{ 

i]P 


Xa 


£ Ai,Xa+2 £ A 2 ]dx{Cp^). 
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Lemma 6.5. Let U he any graphon satisfying (3.10) and t{C 0 ^,U) = 5 g^p' 2 ^^. For any e > 0, there 
exists hu,w '■= such that for all 1 < a < jdi, 

SaiBb, Bb) < ^p^^^ + , (6.7) 

Pi 

where the e dependence in o{p^^^) is suppressed. 

Proof. Note that by rotational symmetry of the cycle, Sa{Ai,A 2 ) does not depend on a. Therefore, 
w.l.o.g. assume a = 1 and xi G Bb. 

Let b', b" G [0,1]. Removing the edges incident to the vertex labeled 1 and by the same argument as 
in (4.9) (using Holder’s inequality (3.1) and bounds in (3.10), (3.8)), whenever 6 " <C b'‘^ we get 

si(H 6 /,Hfe//) < . ( 6 . 8 ) 

Fix e > 0 and 0 <C 6 i 1 and let M := Pi [jJ, where C is constant in (6.5), i.e., t{Cg^, U) < Cp^^^. 
Consider the following quantities for a G {0,1,..., M} 

Da := si(H^ 3“, i3j3“) - si(H^3“,H^3a+i). (6.9) 

Since Da integrates the variable X 3 over the set B^^a+i \B^ 3 °-, which is disjoint for 0 < a < M. Therefore, 

we get = t{C^.^,Lf) > Therefore, there exists oq G {0,1,...,M} such that Da^ < 

Let bu^w = bf°. This choice of bu,w together with ( 6 . 8 ) implies that 

= (si(.Bb„,„,-Bb„,„) - si{Bb^^^,Bbi J) + si{Bb^^^, Bbs J 

< t/3i + o(pf/3i 5^) = /3i + o(pt). □ 

Pi Pi 

Notice that in the above argument the chosen bu,w depends on the cycle Cn (in particular on the 
vertices u and w) and the graphon U, in contrast with the case of irregular graphs. Recall the left-hand 
side in (5.10) and Sa{Bb,Bb) from (6.7), respectively. Notice that the left-hand side in (5.10) has an 
obvious monotonicity in b whereas the latter does not. Using this monotonicity in the irregular case, the 
choice of b could be made independent of the graph H and the graphon U (cf. the proof of Lemma 5.9). 

In this case, for every pair u,w G Ha, the 2-matching Mu^w and the cycle Cr^ could be distinct. 
Thus, to prove ( 6 . 6 ) and one has to provide additional arguments to show a single degree threshold bn 
works for all such u,w. To this end, we make the following observation which strengthens (6.4). 

Observation 6 . 6 . Given e > 0, define I = [5/eJ (where 5 is as in Theorem 2.2). Then for m > I 
and u,w G Ha as in (6.4) and for any sequence bi,b 2 ,.. .bm p^^^ such that bi^i < for all i G [m], 
there exists a subsequence bj.^ > bj^ > ... > all of which satisfy (6.4). 

Proof. Let M := {bi, 62 ,..., 6 m}- By ( 6 . 8 ) and (6.9), there exists some 6^3 among 61 ,..., 6 m which 
satisfies (6.4), as m > i. Consider the largest such bj^. Now, look at the set and repeat 

the above argument. Clearly, this argument can be repeated as long as the length of the remaining 
sequence is at least i. Thus, we obtain the sequence bj^ > bj.^ > ... > which satisfies (6.4). □ 

Using the above result, the proof of Proposition 6.2 can be completed in the following way: Let 
Sh = {(li, R”) ■ u,w G HA,distH{u,w) = 2}, the set of pairs of vertices {u,w) in Ha at distance two 
from each other. All that is left is to find a 6 such that (6.4) holds for all {u,w) G Sh- To this end. 
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take m = i ■ (|5i^| + 1) (where i appears in the statement of Observation 6 . 6 ) and 6 i, 62 , • • • such 
that <C bj+i < b^, for all j G [m]. Fix {{ui, Wi) : 1 < i < |5'(i/)|} an arbitrary order on Sh- 

The proof now follows by repeated applications of Observation 6 . 6 . In the first step we apply the 
observation to the pair (ui, ui) and the sequence 61 , 62 ; • • • bm, to obtain a subsequence. In the next 
step we apply the observation to the pair {u 2 ,V 2 ) and the subsequence obtained in the previous step. 
We iterate the above over all the elements in Sh- Notice that m is chosen to be large enough to ensure 
that this iterative procedure can be run over all the elements of Sh and eventually we end with a 
subsequence of 61 , 62 ,... 6 m of length at least i . Clearly by construction, choosing 6 to be any of these 
i elements, (6.4) holds for all {u,w) G Sh- D 


7. From the continuous to the discrete variational problem 

The following simple lemma connects the discrete and the continuous variational problems: 

Lemma 7.1 ([23, Lemma 2.1]). For any fixed graph H andp,n,6 we have (l){H,p, 6) < n~‘^(p{H,n,p,5). 

Proof. Given Gn G form a graphon W^" as follows: divide [0,1] into n equal intervals Ii, I2, - - -, In 
and set W^'^{x,y) = Ojj if x G G Ij and i j, and W^^{x,y) = p otherwise. Clearly we have 
t{H, Gn) > t{H, W^”-) and Ip{W'^^) = n~‘^Ip{Gn) (diagonal entries contribute 0 to Ip{W^'^)). □ 


7.1. Proof of Theorem 1.5. Fix a graph H with maximum degree A > 2. 
Suppose first that H is irregular. By Theorem 2.3 together with Lemma 7.1, 


(t){H,n,p,6) 
lim . ——— 

n^oo \og[l/p) 


H, 6) 
p-fio p^ log(l/p) 


> lim 




and the anti-clique construction (from §1.3) gives a matching upper bound. 
Next, suppose H is regular. By Theorem 2.2 combined with Lemma 7.1, 


cj){H,n,p,6) 

hm . -^ 

n^oo n^p^ log(l/p) 


> lim 

p->0 p 


4>{H,p,6) 

^ log(l/p) 


= min {0(i7,6),i62/im)l} 


and a matching upper bound is obtained by either the clique or the anti-clique constructions (from §1.3), 
whichever has the lower entropy. □ 


Remark 7.2. As mentioned below Theorem 1.5, when H is regular and n <^p <^n (a range 
in which the anti-clique construction is no longer applicable), it follows from our arguments that 

HH,n,p,6) ^ i.2/\viH)\ _ 

n->-oo log(l/p) ^ 

Indeed, the upper bound follows from the discrete clique Ga just as before, and it remains to verify the 
lower bound, to which end it suffices to consider—thanks to Theorem 2.2 and Lemma 7.1— the case 
where 1 ( 52 /|U(il)| > 9{H,5). Assume that (3.5) holds, and then choose 6 such that np^ <C 6 <C 1. Now, 
recalling the definition of Bb from (3.3), by (3.7) we have 

HBb) <p^/b <C n~^ , 


which implies that the number of vertices with degree bigger than nb is 0. Thus, in all the calculations 
in §6 one can rule out the anti-clique graphon, and the above result follows. 
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7.2. Disconnected graphs. The arguments for connected graphs can be easily generalized to dis¬ 
connected graphs. By considering the different connected components, the solution of the variational 
problem can be expressed as a two variable constrained optimization problem. 

Corollary 7.3. Let H be a graph with maximum degree A and connected components Hi,H 2 ,... ,Hs 
for some s > 1. Define Ph*{x) as Defs. 1.1-1. 2. Then for fixed 5 > 0, 

S pM^ga/p) = ^ n A-regular}0'l®(^“)l/^) = 1 + .jj , 


For disconnected graphs the solution of the variational problem for large 5 might not be attained by 
the clique or the anti-clique graphons, but by a mixture of these two. 


Example 7.4. Let H be the disjoint union of a triangle (^^ 3 ) and a 2-star (Ki^ 2 )- By Corollary 7.3, 


4>iH,p, 6) 


lim 

P^o p^ log(l/p) 


inf \e + U' : (i + 3e + e'^/^){i + e) = i + d} 
e,e'>o I ^ J 

inf {zi + 2^2 ■ (1 + 3zi-I-zf)(l-I-zi) = 1-I-5) 
2:1,22 >0 


(7.1) 


The constraint (1 -|- 32;i -|- Z 2 )(l -|- zi) = 1 -|- 5 is a quadratic in zi with a unique positive solution 

:= — 4 — Z 2 -I- 126 Z 2 — 4^2 -|- 16^ . C^'2) 

From (7.1) it follows that 

The clique corresponds to 2 ;i = 0, so Z 2 = (l-|-( 5 ) 3 , and its objective value is 2(1 + 5)5 ~ \d3 for large 6. 
The anti-clique corresponds to Z 2 = 0, so by (7.2) its objective value is Y^5/3 for large 5. Thus, for 
large 5 both solutions are inferior to (^V^,which gives the objective value + 0(55). 
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